Between Imitation and Intention Learning
نویسندگان
چکیده
Research in learning from demonstration can generally be grouped into either imitation learning or intention learning. In imitation learning, the goal is to imitate the observed behavior of an expert and is typically achieved using supervised learning techniques. In intention learning, the goal is to learn the intention that motivated the expert’s behavior and to use a planning algorithm to derive behavior. Imitation learning has the advantage of learning a direct mapping from states to actions, which bears a small computational cost. Intention learning has the advantage of behaving well in novel states, but may bear a large computational cost by relying on planning algorithms in complex tasks. In this work, we introduce receding horizon inverse reinforcement learning, in which the planning horizon induces a continuum between these two learning paradigms. We present empirical results on multiple domains that demonstrate that performing IRL with a small, but non-zero, receding planning horizon greatly decreases the computational cost of planning while maintaining superior generalization performance compared to imitation learning.
منابع مشابه
An Empirical Characterization of Parsimonious Intention Inference for Cognitive-level Imitation Learning
Imitation learning is a promising route to better collaboration between humans and artificial agents. It will be most effective if the agent has some cognitive-level “understanding” of a human demonstrator’s intentions. Inferring intent is an example of abductive reasoning, wherein an agent explains the available evidence based on causal knowledge. Good explanations should satisfy some notion o...
متن کاملImitation 3 4
6 Imitation—the ability to recognize and reproduce others’ actions— 7 is a powerful means of learning and developing new skills. Species 8 endowed with this capability are provided with fundamental abil9 ities for social learning. In its most complex form, imitation pro10 vides fundamental capabilities for social cognition, such as the rec11 ognition of conspecifics, the attribution of others’ ...
متن کاملActive Imitation Learning of Hierarchical Policies
In this paper, we study the problem of imitation learning of hierarchical policies from demonstrations. The main difficulty in learning hierarchical policies by imitation is that the high level intention structure of the policy, which is often critical for understanding the demonstration, is unobserved. We formulate this problem as active learning of Probabilistic State-Dependent Grammars (PSDG...
متن کاملGoal-directed imitation for robots: A bio-inspired approach to action understanding and skill learning
In this paper we present a robot control architecture for learning by imitation which takes inspiration from recent discoveries in action observation/execution experiments with humans and other primates. The architecture implements two basic processing principles: 1) imitation is primarily directed toward reproducing the goal/end state of an observed action sequence, and 2) the required capacit...
متن کاملA Novel Parsimonious Cause-Effect Reasoning Algorithm for Robot Imitation and Plan Recognition
Manually programming robots is difficult, impeding more widespread use of robotic systems. In response, efforts are being made to develop robots that use imitation learning. With such systems a robot learns by watching humans perform tasks. However, most imitation learning systems replicate a demonstrator’s actions rather than obtaining a deeper understanding of why those actions occurred. Here...
متن کامل